Collaborative knowledge distillation for incomplete multi-view action prediction
نویسندگان
چکیده
Predicting future actions is a key in visual understanding, surveillance, and human behavior analysis. Current methods for video-based prediction are primarily using single-view data, while the real world multiple cameras produced videos readily available, which may potentially benefit action tasks. However, it bring up new challenge: subjects more likely to be occluded by objects when captured from different angles, or suffer signal jittering transmission. To that end, this paper we propose novel student network called Collaborative Knowledge Distillation (CKD) predict with missing information under multi-view setting, i.e., incomplete prediction. First, create graph attention based teacher model capable of fusing video features task. Second, construct corruption pattern bank (CPB) simulate various segments video, each will manage one through privileged knowledge distillation. Third, account arbitrary real-world, ensemble models developed make joint The proposed framework has been extensively evaluated on popular datasets, including PKU-MMD NTU-RGB validate effectiveness our approach best not yet explored setting.
منابع مشابه
Modeling the knowledge-flow view for collaborative knowledge support
In knowledge-based organizations, workers need task-relevant knowledge and documents to support their task performance. A knowledge flow (KF) represents the flow of an individual’s or group members’ knowledge-needs and the referencing sequence of documents in the performance of tasks. Through knowledge flows, organizations can provide task-relevant knowledge to workers to fulfill their knowledg...
متن کاملOverview of View Synthesis Prediction for Multi-view Video Coding
With a wide range of viewing angles, the multi-view video can provide more realistic feeling at arbitrary viewpoints. However, because of a huge amount of data from multiple cameras, we need to develop an efficient coding method. One of the promising approaches for multi-view video coding is view synthesis prediction which generates an additional reference frame. In this paper, we explain the p...
متن کاملUnified subspace learning for incomplete and unlabeled multi-view data
Multi-view data with each view corresponding to a type of feature set are common in real world. Usually, previous multi-view learning methods assume complete views. However, multi-view data are often incomplete, namely some samples have incomplete feature sets. Besides, most data are unlabeled due to a large cost of manual annotation, which makes learning of such data a challenging problem. In ...
متن کاملView Interpolation Prediction for Multi-view Video Coding
Since multi-view video is a collection of videos captured by a multiple camera array, the volume of data is huge. Various algorithms have been developed for multi-view video coding. In this paper, we propose two methods: efficient view interpolation and ‘VIP P-picture’ coding. The view interpolation method includes initial disparity estimation, variable block-based matching, and pixel-level dis...
متن کاملHypercube Based Inter View Prediction for Multi-view Video Coding
Multi-view video coding (MVC) is being standardized by the MPEG committee. One of the important factors is improving the prediction efficiency by exploiting the spatio-temporal redundancies among the views. The prediction structure has to be selected to balance prediction efficiency as well as coding complexity. Increasing the number of reference views will improve the prediction but also incre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Image and Vision Computing
سال: 2021
ISSN: ['0262-8856', '1872-8138']
DOI: https://doi.org/10.1016/j.imavis.2021.104111